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Abstract 



It is well known that the static caching algorithm that keeps the most frequently requested documents in the cache 
is optimal in case when documents are of the same size and requests are independent and equally distributed. How- 
ever, it is hard to develop explicit and provably optimal caching algorithms when requests are statistically correlated. 
In this paper, we show that keeping the most frequently requested documents in the cache is still optimal for large 
cache sizes even if the requests are strongly correlated. 

Keywords: Web caching, cache fault probability, average-case analysis, least-frequently-used caching, least-recently- 
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1 Introduction 

One of the important problems facing current and future network designs is the ability to store and efficiently deliver a 
huge amount of multimedia information in a timely manner Web caching is widely recognized as an effective solution 
that improves the efficiency and scalability of multimedia content delivery, benefits of which have been repeatedly 
verified in practice. For an introduction to the concept of Web caching, the most recent tutorials, references and the 
latest technology, an interested reader is referred to the Web caching and content delivery Web page [8 1. 

Caching is essentially a process of storing information closer to users so that Internet service providers, delivering 
a given content, do not have to go back to the origin servers every time the content is requested. It is clear that keeping 
more popular documents closer to the users can significantly reduce the traffic between the cache and the main servers 
and, therefore, improve the network performance, i.e., reduce the download latency and network congestion. One 
of the key components of engineering efficient Web caching systems is designing document placement/replacement 
algorithms (policies) that are managing cache content, i.e., selecting and possibly dynamically updating a collection 
of cached documents. 

The main performance objective in creating and implementing these algorithms is minimizing the long-term fault 
probability, i.e., the average number of misses during a long time period. In the context of equal size documents and 
independent reference model, i.e., independent and identically distributed requests, it is well known (see |5 1, Chapter 6 
of ifTSi ) that keeping the most popular documents in the cache optimizes the long term cache performance; throughout 
this paper we refer to this algorithm as static frequency caching. A practical implementation of this algorithm is known 
as Least-Frequently-Used rule (LFU). However, the previous model does not incorporate any of the recently observed 
properties of the Web environment, such as: variability of document sizes, presence of temporal locality in the request 
patterns (e.g., see ||9|, lO, 121, 161, Q and references therein), variability in document popularities (e.g., see O]) and 
retrieval latency (e.g., see 11]). 

Many heuristic algorithms that exploit the previously mentioned properties of the Web environment have been 
proposed, e.g., see Q, 0, |fT3l and references therein. However, there are no explicit algorithms that are provably 
optimal when the requests are statistically correlated even if documents are of equal size. Our main result of this 
paper, stated in Theorem [T] of Section |3] shows that, in the generality of semi-Markov modulated requests, the static 
frequency caching algorithm is still optimal for large cache sizes. The semi-Markov modulated processes, described 
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in Section |2] are capable of modeling a wide range of statistical correlation, including the long-range dependence 
(LRD) that was repeatedly experimentally observed in Web access patterns; these types of models were recently used 
in ifTOl and their potential confirmed on real Web traces in f9l. In Section]?] under mild additional assumptions, we 
show how our result extends to variable page sizes. Our optimality result provides a benchmark for evaluating other 
heuristic schemes, suggesting that any heuristic caching policy that approximates well the static frequency caching 
should achieve the nearly-optimal performance for large cache sizes. In particular, in conjunction with our result 
from [ilOj . we show that a widely implemented Least-Recently-Used (LRU) caching heuristic is, for semi-Markov 
modulated requests and generalized Zipf's law document frequencies, asymptotically only a factor of 1.78 away from 
the optimal. Furthermore, similar results can be expected to hold for the improved version of the LRU caching, termed 
Persistent Access Caching, that was recently proposed and analyzed in [12.1 . 

2 Modeling statistical dependency in the request process 

In this section we describe a semi-Markov modulated request process. As stated earlier, this model is capable of 
capturing a wide range of statistical correlation, including the commonly empirically observed LRD. This approach 
was recently used in 1 10|, where one can find more details and examples. 

Let a sequence of requests arrive at Poisson points {t„, ~oo < n < oo} of unit rate. At each point t„, we use 
Rn G {1,2,..., N}, to denote a document that has been requested, i.e., the event {_R„ — i} represents a request for 
document i at time t„; we assume that the sequence {i?„} is independent of the arrival Poisson points {t„} and that 
F[Rn = i] > for all i and P[i?„ < oo] = 1. 

Next, we describe the dependency structure of the request sequence {Rn}- We consider the class of finite-state, 
stationary and ergodic semi-Markov processes J, with jumps at almost surely strictly increasing points {T„, — oo < 
n < oo}. To < < Ti. Let process {Jt„, — oo < n < oo} be an irreducible Markov chain that is independent of 
{t„}, has finitely many states {1, . . . , M} and transition matrix {pij}- Then, we construct a piecewise constant and 
right-continuous mot/M/a/wg process {Jt} such that 

Jt = JT„, ifr„<i<T„+i; 

for more details on the construction of process Jf , i e R see Subsection 4.3 of 1 10|. Let tt,, = P[Jt = r], 1 < r < M, 
be the stationary distribution of J and, to avoid trivialities, we assume that min.r tt^ > 0. For each 1 < ?' < M, let 

(r) (r) 

Qi ,i ^ i N < oo, be a probability mass function, where ' is used to denote the probability of requesting 
item i when the underlying process J is in state r. Next, the probability law of {Rn} is uniquely determined by the 
modulating process J according to the following conditional distribution, 

n 

mi^^iA<l<n\Jt,0<t<T^]^Y[ql'^'\ n>l, (1) 

1=1 

i.e., the sequence of requests i?„ is conditionally independent given the modulating process J. Given the properties 
introduced above, it is easy to conclude that the constructed request process {Rn} is stationary and ergodic as well. 
We will use 

M 
r=l 

to express the marginal request distribution, with the assumption that qi > {) for alH > 1. In addition, assume that 
requests are enumerated according to the non-increasing order of marginal request popularities, i.e., qi > g2 > • • • • 

In this paper we are using the following standard notation. For any two real functions a{t) and h{t) and fixed 
to G K U {oo} we use a{t) ~ bit) as t ^ to to denote Wmt^tg [o-{t)/b{t)] = 1. Similarly, we say that a{t) > b{t) as 
t to if Mminit^ta a{t) /b{t) > 1; a{t) ^ b{t) has a complementary definition, i.e., limsupj^^^^ a(t)/fe(t) < 1. 

Throughout the paper we will exploit the renewal (regenerative) structure of the semi-Markov process. In this 
regard, let {%}, % < < Ti, be a subset of points {T„} for which Jt„ = 1. Then, it is well known that {%} 
is a renewal process and that sets of variables {Jt,Tj < t < T^+i} are independent for different j and identically 
distributed, i.e., {%} are regenerative points for { Jt}. Furthermore, the conditional independence of {Rn} given {Jt}, 
implies that {%} are regenerative points for i?„ as well. 
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Next we define TZr{u,t), 1 < r < M, to be a set of distinct requests that arrived in interval [u,t), u < t, and 
denote by Nr{u, t), 1 < r < M, the number of requests in interval [u, t) when process Jt is in state r. Furthermore, 
let N{u, t) ~ Ni{u,t) + ■ ■ ■ + N]\j{u, t) represent the total number of requests in [u, t); note that N{u, t) has Poisson 
distribution with mean t — u. 

The following technical lemma will be used in the proof of the main result of this paper. 

Lemma 1 For the request process introduced above, the following asymptotic relation holds 

P[i e 7^(rl, Tz)] - q,¥.[T2 - Ti] as i -> oo, (2) 
where TZ{u, t) = TZi {u,t) U ■ ■ ■ U TZm {u, t). 

Proof: Given in Section |5] O 



3 Caching policies and the optimality 



Consider infinitely many documents of unit size out of which x can be stored in a local memory referred to as cache. 
When an item is requested, the cache is searched first and we say that there is a cache hit if the item is found in the 
cache. In this case the cache content is left unchanged. Otherwise, we say that there is a cache fault/miss and the 
missing item is brought in from the outside world. At the time of a fault, a decision whether to replace some item from 
the cache with a missing item has to be made. We assume that replacements are optional, i.e., the cache content can 
be left unchanged even in the case of fault. A caching algorithm represents a set of document replacement rules. 

We consider a class of caching algorithms whose information decisions are made using only the information of 
past and present requests and past decisions. More formally, let Cf = C^{x) be a cache content at time t under 
policy TT. When the request for a document i?„ is made, the cache with content is searched first. If document 
Rn is already in the cache (i?„ G ), then we use the convention that no document is replaced. On the other hand, 
if document i?„ is not an element of C^^, then a document to be replaced is chosen from a set C^^ U {R„} using 
a particular eviction policy. At any moment of request, t„, the decision what to replace in the cache is based on 

i?2, • ■ ■ , Rn, C*^, , C^j, • ■ ■ , C^^. Note that this information already contains all the replacement decisions made 
up to time r„. This is the same information as the one used in the Markov decision framework [SJ. 

The set of the previously described cache replacement poUcies, say Vc, is quite large and contains mandatory 
caching rules (more typical for a computer memory environment), i.e., those rules that require replacements in the case 
of cache faults. Furthermore, the set Vc also contains the static algorithm that places a fixed collection of documents 

= C in the cache and then keeps the same content without ever changing it. 

Now, define the long-run cache fault probability corresponding to the policy tt E Vc and a cache of size x as 



E 



P{tt, x) = limsup ■ 



e[o,T] 



T 



(3) 



recall that EA^(0, T) = T. Note that we use the limsup in this definition since the limit may not exist in general and 
that, as defined before, C^^ = C"^ [x] is a function of x and we suppress it from the notation. 



Next, we show that 



E 



P(7r, x) = limsup ■ 

k — >OQ 



(4) 



Eiv(o,rfc) 

where Tk are the regenerative points, as defined in the previous section. Note that estimating the previous expression is 
not straight forward since replacement decision depends on all previous requests, i.e., it depends on the past beyond the 
last regenerative point. To this end, for the lower bound, for any < e < 1, let fc = k{T, e) = [r(l — e)/E[72 — Tl] J , 
where \ u\ is the largest integer that is less or equal to u. Then, note that 



1 



-E 



r„e[0,T] 



> E 



> 



1\%<T] 



T 



-E 



i[Tk > r 



7V(0,T) 
f 



(5) 
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Next, using the Weak Law of Large Numbers for P[7^. > T] ^ (as T ^ oo) and the fact that (0, T) is Poisson 
with mean T in the preceding inequality, we obtain 



E 



P{Tr,x) > (1 — e) limsup 

,_\T(l-j) I 
''-[E[r2-Til J 



E 



EN{0,%) 



(1 — e) limsup ■ 

k — >oo 



E7v(o,rfe) 



since the set {k : k = — e)/E[72 — Ti]\,T > 0} covers all integers. We complete the proof of the lower 

bound by passing e ^ 0. The upper bound uses similar arguments where, in this case, k is defined as fc = k{T, e) = 
[T(l + e)/E[r2 - Ti]], and P(7r, x) is upper bounded as 



1 

-E 
T 



r„6[0,T] 



< E 



l[T<Tk]- 



-E 



l[T>%] 



jV(o,r) 



Then, similarly to earlier arguments, we derive the corresponding upper bound for P{tt, x) in (|4|i. 

Next, observe the static policy s, where C^^ = {1, 2, . . . , x} for every n. Then, due to the ergodicity of the request 
process, the long-run cache fault probability of the static policy is 

Psix)^P{s,x)^J2l^- 

Since the static policy belongs to the set of caching algorithms Vc, we conclude that 



Psix) > inf P{tt,x). 



(6) 



Our goal in this paper is to show that for large cache sizes x there is no caching policy that performs better, i.e., 
achieves long-term fault probability smaller than Ps{x). This is stated in the following main result of this paper 

Theorem 1 For the semi-Markov modulated request process defined in Section^ the static policy that stores docu- 
ments with the largest marginal popularities minimizes the long-term cache fault probability for large caches, i.e.. 



inf P{tt, x) ^ Ps{x) as X ^ oo. 



(7) 



Remarks: (i) From the examination of the following proof it is clear that the result holds for any regenerative request 
process that satisfies Lemma[T] (ii) Though asymptotically long-term optimal, the static frequency rule possesses other 
undesirable properties such as high complexity and lack of adaptability to variations in the request patterns. However, 
its optimal performance presents an important benchmark for evaluating and comparing widely implemented caching 
policies in the Web environment. On the other hand, it is a question whether a widely accepted analysis of the cache 
miss ratio is the most relevant performance measure to analyze. A strong argument in support to this choice is that 
other measures would be harder (sometimes impossible) to analyze. However, in SectionlH we present some possible 
extensions of our results to the analysis of other objective functions, such as long-run average delay of fetching 
documents not found in the cache, or long-run average cost of retrieving documents outside of the cache, etc. (iii) 
Note that the condition qi, i > 1, given in the previous section makes the problem of proving asymptotic optimality 
nontrivial. In case qi > for just a finite number of z's, the document population would be finite and the result 
above would be trivially true, (iv) The preliminary version of this work was presented in the Workshop on Analytic 
Algorithms and Combinatorics (ANALCO'2006), Miami, Florida, Januai-y 2006. 
Proof: In view of (|6]l, we only need to show that inf^g-p^ Pi^^, x) > Ps{x) as a: — > oo. 

For any set A, let |^| denote the number of elements in A and A \ B represent the set difference. Then, it is easy 
to see that the number of cache faults in [t, u), t < u, is lower bounded by \TZ{t, u) \ \ since every item that was not 
in the cache at time t results in at least one fault when requested for the first time; in particular, if t — Tj, u 



J2 iK^c;j > |7^(7^■,7;■+l)\c^J. 



(8) 
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This inequality and ^ results in 



k-l 



P{tt, x) > limsup J2 T^+i) \ Cf^. |]. (9) 

Now, since we consider caching policies where replacement decisions depend only on the previous cache contents and 
requests, due to the renewal structure of the request process we conclude that for every j > 1 and all i > 1, events 
{i E TZ{Tj,Tj+i)} and {i e } are independent and, therefore, for every j > 1, 



E 



|7^(7^■,7^■+l)\CfJl[C?^ -C]J =^P[^e7^(T,■,7^■+l),^^C]P[Cf^. = C] 

i>i 

= ¥[Cl = C]J2n^ e 7e(7^-,7;-+i)]l[z ^ C] 



i>i 

> P[C?^. = C] M e 7^(7^■,7;■+l)]. 



Then, after summing over all values of C, for any j > 1 we obtain 

E[|7^(7^■,7;■+l)\CfJ]> inf e TZ{T, ,T,+,)]. (10) 

Next, we show that the cache content C ^ [l.x] ^ {1, . . . ,x} achieves the infimum in the previous expression for 
large cache sizes. This is equivalent to proving that, as a; — > oo. 

We will justify the previous statement by showing that for any set C obtained from [1, x] by placing documents from 
the set + 1, ... } instead of those in [1, x] can not result in J2i^c '^I* ^ ''^(Tj j Tj+i)] < (1 ~ ^) J2i<^[i x] ^{"^ ^ 
TZ{Tj,Tj+i)] for large cache sizes x and any < e < 1. 

Lemma[T]implies that for an arbitrarily chosen e > there exists finite integer io such that for all i > io 

(1 - e)q^E[TJ+l - Tj] < P[i e TZ{Tj,Tj+i)] < (1 + e)q,E[Tj+i - T^]. (12) 

Thus, using the previous expression and qi \, as i ^ oo, we conclude that for all k < io there exists xq > io, such 
that for all i > xo 

min ¥[k e 7^(7:■,7:■+l)] > F[i e 7^(7:■, Tl+i)]. (13) 

l<k<iQ 

Now, assume that the cache is of size x > xq and observe different cache contents C obtained from [1, x] by replacing 
its documents with items from {x + l,x + 2, . . .}. Next, using ( fTST l. we conclude that replacing documents enumerated 
with {1, . . . , io} can only increase the sum on the left-hand side of (fTTl i. On the other hand, observe cache contents 
C that are obtained from [1, x] by replacing documents enumerated as {io + 1, . . . , with items from {x + 1, . . .}. 
Then, it is easy to see that proving inequality ( fTTT ) is equivalent to showing that J2ie[io+i x] ^I* ^ ^('^i^+i)] ^ 
(1 — e)X]iec\[i io] ^ ^("^ iTj+i)]' ^'^y < e < 1. Next, since for any i > io inequalities ( fT2] i hold, we 
conclude 

E.g[.o+l,.]IP[»e7^(7^^,?^^+l)] ^ (i-e)E.s[.o+i,.]g» ^1-6 

E.ec\[MolP[*e7^(7^■,r,+l)] - (l + e)E.ec\[M„]9» " 1 + ^' 
where the second inequality in the previous expression follows from the monotonicity of qiS. Then, by passing e ^ 
we prove inequality (fTTI) . 

Note that after applying the lower bound ( fTTT ) in ( fTOl ), in conjunction with (|9]l, the renewal nature of the regenerative 
points and Lemma[Tl we obtain that, as x oo, 

inf P(7r,x) > V<7„ (14) 
which completes the proof of the theorem. O 
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4 Further extensions and concluding remarks 



In this paper we prove that the static frequency rule minimizes the long term fault probability in the presence of 
correlated requests for large cache sizes. 

There are several generalizations of our results that are worth mentioning. First, the definition of the fault probabil- 
ity in (|4|l can be generalized by replacing terms l[i?„ ^ ] with /(_R„)l[i?„ ^ ], where f{i) could represent the 
cost of retrieving document i, e.g., the delay of fetching item i not found in the cache. Assume that < /(«) < A' < oo 
and let 5 be a set of x items such that qif{i) > Qjfij) for alH e 5 and j ^ S. Then, the following result holds: 

Theorem 2 For the semi-Markov modulated request process defined in Section |2] the static caching policy C = S 
minimizes the long-run average cost function /(•) (e.g., delay) for documents not found in the cache. 

Sketch of the proof: The proof of this theorem follows completely analogous arguments to those used in the proof of 
Theorem[ri and, in order to avoid repetitions, we outline its basic steps. 

Similarly as in (O, the long-run average cost for documents not found in the cache that corresponds to the caching 
policy TT e T'c is defined as 



I?(7r, x) = lim sup ■ 

T->oo 



E 



T 



Then, by using similar arguments to (|4|l - (|6l) and < f{i) < K < oo, z > 1, we obtain that the long-run average cost 
of the static policy Cr„ = S, n > 1, for the cache with size x satisfies 



D,ix)=y2fiz)q,> inf Din, 



Next, in order to prove 



_Ds(a;) < inf D(tt,x) as a; — > oo. 



(15) 



(16) 



similarly as in the proof of Theorem [T] we lower bound the number of cache misses, and, therefore, the average cost 
in every regenerative interval [Tj , T^+i ), j > 1, as 

f{Rn)mn ^ CI] > ^ e 7^(7^■,7^■+l),^ ^ CrJ. 

Next, since we consider caching policies whose replacement decisions depend only on the past cache contents and 
requests, due to the renewal structure of the request process, we conclude that for any j > 1, 



E 



J2 fiRn)mn^c:ji[ci=c] 

and, thus, similarly as in ( fTOl l, we obtain 



J2 f{Rn)l[Rn^C:j 



Now, given the previous observations, the asymptotic inequality ( fT6l ) is proved using analogous arguments to those in 
( fT2] i - ( fT4b . Note that in the context of this result, inequality ( fT3] l becomes 

mm /(fc)P[fc e 7^(7^■,7^■+l)] > /WP[z G n{T,,T,+,)] 

1<K<20 

for all i > xq, and we have analogous asymptotic linearity as in ( fT2] ) since | as i — > cx) and < f{i) < K < oo. 
Finally, as in ( fTST l - ( fT4b . the rest of the proof is based on proving that no replacements of documents in the set S 
can lead to smaller long-run average delays for large cache sizes x. Thus, the asymptotic bound ( fTSI l holds and, in 
conjunction with ( fTSl ). completes the proof of the theorem. O 



In addition to the previous generalization, in the context of documents with different sizes, one can prove the 
following result: 
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Theorem 3 Assume that documents have different sizes and that they are enumerated according to the non-increasing 
order of qi/si, i.e., gi/si > qij > where Si is the size of document i and Si £ {si,...,sd}, where 

si, . . . , S£) < oo and D < oo. Then, for the semi-Markov modulated request process defined in Section |2] ;/ 
Sj">i 1j ~ Sj>i 1j OS i oo, i.e., X]j>i 9j long-tailed, the static rule that places documents with the small- 
est index in the cache, subject to the constraint '^iSi < x, is asymptotically optimal. 

Proof: In light of the identical arguments to those used in the proof of Theorem[T] it is not hard to show that the static 
policy minimizes the long run average number of misses for large cache sizes. More specifically, the optimal long-run 
cache fault probability is of the form 

where Mx is the set of document indices that minimizes ( fTTI l subject to the constraint X^j^AA — The previous 
problem is a knapsack problem (see Section 5.2 of 1 16| for further explanations). It is shown that in the case where 
objects can be split to exactly fill the knapsack, the policy that minimizes ( fTTl ) is the one that places documents with 
the largest q-J Si values in the cache until an object, say rix, fails to fit. Then, the optimal solution is to split document 
Ux to fill the cache completely. Since that is not possible in the case of document caching, it is not hard to see that 
the fault probability for the optimal static placement in our case is between the optimal fault probabilities in the case 
of cache sizes X]"=i ^ Y^l=i (note that X^T^T^ Si < x < X"=i ^i)- Now, since Ux monotonically increases 

as X increases, in conjunction with the long-tailed assumption of the theorem, Xi>n ^ Xi>n <?i as a; ^ oo, we 
conclude the proof of the theorem. O 

Finally, in light of our recent result on the asymptotic performance of the ordinary LRU caching rule in the presence 
of semi-Markov modulated requests and Zipf's law marginal distributions {qi ~ c/z" as i ^ oo, c > 0) obtained in 
Theorem 3 of 1 10|, asymptotic optimality of the static frequency rule implies that the LRU is factor e'^ w 1.78 away 
from the optimal (7 is the Euler constant, i.e. 7 w 0.57721 . . .). Therefore, in view of other desirable properties, its 
self-organizing nature and low complexity, the LRU rule has excellent performance even in the presence of statistically 
correlated requests. Furthermore, stronger optimality conclusions could be drawn for the recently proposed versions 
of the LRU policy (see [11] and 1 12 1), given that the performance analysis of these algorithms can be extended to the 
correlated setting such as the one in [10|. 



5 Proof of Lemma [D 



In this section, we prove the asymptotic relation dU stated at the end of Section|2] 
Note that 

P[i € 7e(Ti,T2)] = 1 - P[i ^ 7ei(Ti,T2), . . . ,i ^ nM{TuT2, 



-,(l)\JVi 



(1 



-.iM)^N^ 



(18) 



where Nr Nr{Ti,T2), 1 < r < M. Then, since ^ as i ^ cx) and min^ tt^ > 0, it follows that qf^ as 
i —* 00, 1 < r < M. In addition, 1 — e^^ < x for all a; > and for any 1 > e > 0, there exists xo(e) > 0, such that 
for all < X < xo{e) inequality 1 — a; > e^^'^+^' holds, and, therefore, for i large enough 



E 



1 



,-(g<'>iVi+-+9f'"A'M) 



< 
< 



l-{l-qi'Y^...il~q 

I _ g-(l+e)(9r'A'l+---+?f"jVA/) 



(19) 



Then, since 1 



< X for a; > 0, we obtain, for i large enough. 



E 



1 ^ e~{l+<^KN,q\'^+-+NMqi'''>) 



< (1 + e)IE 



q^^N, + --- + ql''^NM 



(20) 



Next, let ^ A^i 
Jf, it follows that 



Nm- Then, we show that 'ETV; 



q^^^-'EA^Af — <ZiEiV. From the ergodicity of 



Vt = = 



E[r2 - Ti] ' 
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where Tir, 1 < r < M, is the length of time that Jt spends in state r during the renewal interval {Ti,T2) (see 
Section 1.6 of H). Finally, using EN = E[T2 - Ti] and EA^,. = ETi^, 1 < r < M (Poisson process of rate 1), in 
conjunction with ( [20] i. we conclude, for i large 



E 



1 _ ^~{l+<^){Niql'^+-+NMql"'>) 



< (l + e)<7,E[T2-Ti]. 



Next, we estimate the lower bound for the left hand side in ( |T9] l. After conditioning, we obtain 



E 



1 



> 



1 



(21) 



(22) 



where qf "^ < qi — qij min^ Hr < Hqi, 1 < < Af, and some large enough constant < < oo . Then, note that 
for every lli e {A^ < q^ ^1,9, A'^i + • • • + VVa/ < = V^- ^'^ addition, for any 1 > e > 0, there exists 
> 0, such that for all < a: < inequality 1 — e^^ > (1 — holds and, therefore, for i large enough such that 



1 - e-(^^«»^ 



> (l-e)E (iVigf^ + ...+iVM<zf'^)l 



> (1 - e)q,E[T2 - Ti] - (1 - e)q,E 



Nl 



N <q,^ 



Then, since EA^ < oo and Xj \fqi ^ oo as i ^ oo, it is straightforward to conclude that E[Afl[iV > 
EiV — E[iVl [A^ < 1/ \fqi^ ^ as i — > cx), and, therefore, in conjunction with (l22l) . we obtain 



> (l-e)<z,E[T2-ri], 



as I — > oo. Finally, after letting e ^ in the previous expression and ( |2TI ). we complete the proof of this lemma. O 
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